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♦ ABSTRACT 



In comparison to automatic parallelization, which is thoroughly studied in the literature [31, 33], 
classical analyses and optimizations of explicitly parallel programs were more or less neglected. This 
may be due to the fact that naive adaptations of the sequential techniques fail [24], and their 
straightforward correct ones have unacceptable costs caused by the interleavings, which manifest the 
possible executions of a parallel program. Recently, however, we showed that unidirectional bitvector 
analyses can be performed for parallel programs as easily and as efficiently as for sequential ones 
[17], a necessary condition for the successful transfer of the classical optimizations to the parallel 
setting.In this article we focus on possible subsequent code motion transformations, which turn out 
to require much more care than originally conjectured [17]. Essentially, this is due to the fact that 
interleaving semantics, although being adequate for correctness considerations, fails when it comes 
to reasoning about efficiency of parallel programs. This deficiency, however, can be overcome by 
strengthening the specific treatment of synchronization points. 
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